Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Goose is not extracting article whole text #278

Open
AgoloAhmedElhady opened this issue Jul 3, 2018 · 0 comments
Open

Goose is not extracting article whole text #278

AgoloAhmedElhady opened this issue Jul 3, 2018 · 0 comments

Comments

@AgoloAhmedElhady
Copy link

Brief

I am facing an issue with goose extraction, some times it stops and return just part of the text with no logical reason. I mean the below article example, goose stopped at part of the article that is not followed by anything irregular in the html, it was just a new <p> so did anyone face this issue can give us any workarounds or at least an explanation on why this is happening and whether or not I will need to look for other alternatives.

Python Code

    g = Goose()
    article = g.extract(url=url)
    return send_request(article.cleaned_text.encode('utf-8'), 'text/raw')

example article url

As Trump Escalates Trade Fight, China Can Take the Hit

Output:

SHANGHAI — Thanks to President Trump 's tariffs, Americans will soon be paying more for a wide variety of Chinese-made goods, and some American customers may end up buying from other countries instead.

For now, China can live with that.

The tariffs the White House announced on Friday will have little immediate impact on China , despite the size of the $50 billion in goods involved and the invective the move set off from Chinese official news media . Mr. Trump's tariffs are ultimately too small and narrowly targeted to seriously impact China's nearly $13 trillion economy, which no longer depends so much on exports and can easily find other places besides the United States to sell its products. In some ways, they are even smaller than tariffs imposed by previous presidents.

The tariffs could spread, of course. The United States has threatened to impose tariffs on $100 billion more in Chinese-made goods and could theoretically hit more than $500 billion in products, the total amount that Americans buy from China. China could retaliate with its own tariffs on the United States' far smaller exports in the other direction across the Pacific, plus impose punitive measures against American companies doing business here.

Any measures carry the risk that they could disrupt the global supply chain in sudden and unexpected ways, or could damage confidence among investors in building factories and other businesses in either country. Already there are sign of strains in the global economy from the broader trade tensions, weakness that China and the United States are both better positioned to weather than other nations.

Thank you very much in advance.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

1 participant